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Abstract. An average-time game is played on the infinite graph of con- 
figurations of a finite timed automaton. The two players, Min and Max, 
construct an infinite run of the automaton by taking turns to perform 
a timed transition. Player Min wants to minimise the average time per 
transition and player Max wants to maximise it. A solution of average- 
time games is presented using a reduction to average-price game on a 
finite graph. A direct consequence is an elementary proof of determinacy 
for average-time games. This complements our results for reachability- 
time games and partially solves a problem posed by Bouyer et al., to 
design an algorithm for solving average-price games on priced timed au- 
tomata. The paper also establishes the exact computational complexity 
of solving average-time games: the problem is EXPTIME-complete for 
timed automata with at least two clocks. 



1 Introduction 

Real-time open systems are computational systems that interact with environ- 
ment and whose correctness depends critically on the time at which they perform 
some of their actions. The problem of design and verification of such systems can 
be formulated as two-player zero-sum games. A heart pacemaker is an example 
of a real-time open system as it interacts with the environment (heart, body 
movements, and breathing) and its correctness depends critically on the time 
at which it performs some of its actions (sending pace signals to the heart in 
real time). Other examples of safety-critical real-time open systems include nu- 
clear reactor protective systems, industrial process controllers, aircraft-landing 
scheduling systems, satellite-launching systems, etc. Designing correct real-time 
systems is of paramount importance. Timed automata [2] are a popular and 
well-established formalism for modelling real-time systems, and games on timed 
automata can be used to model real-time open systems. In this paper, we in- 
troduce average-time games which model the interaction between the real-time 
open system and the environment; and we are interested in finding a strategy 
of the system which results in minimum average-time per transition, assuming 
adversarial environment. 

Related Work. Games with quantitative payoffs can be studied as a model 
for optimal-controller synthesis [31116] . Among various quantitative payoffs the 
average-price payoff |9I8| is the most well-studied in game theory, Markov de- 
cision processes, and planning literature |8ll6j . and it has numerous appealing 



interpretations in applications. Most algorithms for solving Markov decision pro- 
cesses [TB] or games with average-price payoff work for finite graphs only [1818] . 
Asarin and Maler [3] presented the first algorithm for games on timed automata 
(timed games) with a quantitative payoff: reachability-time payoff. Their work 
was later generalised by Alur et al. f and Bouycr et al. 6 to give partial de- 
cidability results for reachability-price games on linearly-priced timed automata. 
The exact computational complexity of deciding the value in timed games with 
reachability-time payoff was shown to be EXPTIME in [1117] . Bouyer et al. [5] 
also studied the more difficult average-price payoffs, but only in the context of 
scheduling, which in game-theoretic terminology corresponds to f-player games. 
They left open the problem of proving decidability of 2-player average-reward 
games on linearly-priced timed automata. We have recently extended the results 
of Bouyer et al. to solve f-player games on more general concavely-priced timed 
automata [12j . In this paper we address the important and non-trivial special 
case of average-time games (i.e., all locations have unit costs), which was also 
left open by Bouyer et al. 

Our Contributions. Average-time games on timed automata are introduced. 
This paper gives an elementary proof of determinacy for these games. A new 
type of region [5] based abstraction — boundary region graph — is defined, which 
generalises the corner-point abstraction of Bouyer et al. [S] . Our solution allows 
computing the value of average-time games for an arbitrary starting state (i.e., 
including non-corner states). Finally, we establish the exact complexity of solving 
average-time games: the problem is EXPTIME-complete for timed automata 
with at least two clocks. 

Organisation of the Paper. In Section[2]we discuss average-price games (also 
known as mean-payoff games) on finite graphs and cite some important results for 
these games. In Section [3] we introduce average-time games on timed automata. 
In Section |4] we introduce some region-based abstractions of timed automata, 
including the closed region graph, and its subgraphs: the boundary region graph, 
and the region graph. While the region graph is semantically equivalent to the 
corresponding timed automaton, the boundary region graph has the property 
that for every starting state, the reachable state space is finite. We introduce 
average-time games on these graphs and in Section [S] we show that if we have 
the solution of the average-time game for any of these graphs, then we get 
the solution of the average-time game for the corresponding timed automaton. 
Finally, in Section[7]we discuss the computational complexity of solving average- 
time games. 

Notations. We assume that, wherever appropriate, sets Z of integers, N of 
non- negative integers and K of reals contain a maximum element oo, and we 
write N+ for the set of positive integers and K© for the set of non-negative 
reals. For n £ N, we write l\n\)m for the set {0, 1, . . . ,n}, and (\n\jM for the set 
{r S M : < r < n} of non-negative reals bounded by n. For a real number 
r G K, we write |r| for its absolute value, we write [rj for its integer part, i.e., 
the largest integer n G N, such that n < r, and we write ^rj for its fractional 
part, i.e., we have |r j = ?- — [rJ . 



2 Average-Price Games 



A (perfect-information) two-player average-price game [1818] (also known as 
mean-payofff game) F — (V, E, Vuax, VuimP) consists of a finite directed graph 
(F, -B), a partition V ~ X^iax U Vuin of vertices, and a price function n : E ^ T,. 
A play starts at a vertex vq G V . li vq G V^, for p e { Max, Min }, then 
player p chooses a successor of the current vertex vq, i.e., a vertex ui, such 
that [vq.vi) e E, and ?;i becomes the new current vertex. When this happens 
then we say that player p has made a move from the current vertex. Play- 
ers keep making moves in this way indefinitely, thus forming an infinite path 
r — (fo, vi,V2t ■ ■) in the game graph. The goal of player Min is to minimise 
^Min('") = liinsup„^f^(l/n) • ''^{'^i-ii '^i) s-iid the goal of player Max is to 

maximise -4Max(?') = liminf„^oo(l/n) • X)"=i T^i'^i-i^'^i)- 

Strategies for players are defined as usual |18l8j . We write i^Min (-S'lviax) for the 
set of strategies of player Min (Max) and Uyiin (-?7Max) for the set of positional 
strategies of player Min (Max). For strategies fi S Syiin and x G -S'Max, and for 
an initial vertex u e 1^, we write run(w, fx, x) for the unique path formed if players 
start in the vertex v and then they follow strategies /z and x, respectively. For 
brevity, we write Aminiv, x) for .4Min(run(w, ^, x)) and we write ylMax(w, x) 
for ^Max(run(u,/z,x)). 

For a vertex u e T^, we define the upper value as 

\i^\{v) ^ inf sup ylMin(w,At,x), 

and the lower value as 

yaK?;) = sup inf ^Max(w, Ai, x)- 

Note that the inequality yai(i;) < val(i>) always holds. A game is determined if 
for every v &V , we have ya|(t;) — val(w). We then write val(w) for this number 
and we call it the value of the average-price game at the vertex v. 

We say that the strategies /i* G Syiin and %* £ X'Max are optimal for the re- 
spective players, if for every vertex w S F, we have that sup^g^^^^^ ^Min(v, M* , x) = 
val(i') and inf^g^^j^^^ ^Min(w, M*: x) = yal(?;). Liggett and Lippman [13] show that 
all perfect-information (stochastic) average-price games are positionally deter- 
mined. 

Theorem 1. Every average-price game is determined, and optimal posi- 
tional strategies exist for both players, i.e., for all v ^ V , we have: 

inf sup ^Mm(w,MiX)= sup inf AMax{v, tJ.,x)- 

The decision problem for average- price games is in NP n co-NP; no polynomial- 
time algorithm is currently known for the problem. 



3 Average-Time Games 



3.1 Timed Automata 

Before we present the syntax of the timed automata, we need to introduce some 
concepts. Fix a constant k € N for the rest of this paper. Let C be a finite 
set of clocks. Clocks in timed automata are usually allowed to take arbitrary 
non-negative real values. For the sake of simplicity and w.l.o.g [1], we restrict 
them to be bounded by some constant k, i.e., we consider only bounded timed 
automata models. A (fc-bounded) clock valuation is a function v : C —> (IfeDg; we 
write V for the set [C dfcDa] of clock valuations. If G V and t G then 
we write ly + t for the clock valuation defined by {v + t)(c) — i/(c) + t, for all 
c G C. For a set C" C C of clocks and a clock valuation ly : C , we define 

reset(i^, C")(c) = if c G C", and reset(i^, C")(c) — ^{c) if c ^ C". A corner is an 
integer clock valuation, i.e., a is a corner if a(c) G (|fcDN, for every clock c G C. 

The set of clock constraints over the set of clocks C is the set of conjunctions 
of simple clock constraints^ which are constraints of the form cixiiorc — c'cxii, 
where c,c' £ C, i e ^k\)n, and ixi G {<,>,=,<,> }. There are finitely many 
simple clock constraints. For every clock valuation G V, let SCC(;^) be the set 
of simple clock constraints which hold in G V. A clock region is a maximal 
set P C V, such that for all i/jv' G P, SCC(i^) = SCC(i^'). In other words, 
every clock region is an equivalence class of the indistinguishability-by-clock- 
constraints relation, and vice versa. Note that v and v' are in the same clock 
region iff all clocks have the same integer parts in v and v', and if the partial 
orders of the clocks, determined by their fractional parts in ly and v', are the 
same. For all G V, we write [v] for the clock region of i^. A clock zone is a 
convex set of clock valuations, which is a union of a set of clock regions. Note 
that a set of clock valuations is a zone iff it is definable by a clock constraint. 
For W CV, we write clos(H^) for the smallest closed set in V which contains W. 
Observe that for every clock zone W, the set clos(H^) is also a clock zone. 

Let L be a finite set of locations. A configuration is a pair (£, j/), where £ d L is 
a location and G V is a clock valuation; we write Q for the set of configurations. 
If s = {£, ly) d Q and c G C, then we write s(c) for v{c). A region is a pair {£, P), 
where ^ is a location and P is a clock region. If s = {£, v) is a configuration then 
we write [s] for the region {£, [v]). We write TZ for the set of regions. A set Z C Q 
is a zone if for every £ & L, there is a clock zone (possibly empty), such that 
Z = {{£,v) : £ e L and v G We}. For a region R = (£, P) G Tl, we write clos(i?) 
for the zone {{£^1') ■ v G clos(P)}. 

A timed automaton T = (L,C, S, A, E,S, g) consists of a finite set of loca- 
tions L, a finite set of clocks C, a set of states 5* C Q, a finite set of actions A, 
an action enabledness function E : A ^ 2"^, a transition function S : L x A L, 
and a clock reset function g : A 2^ . We require that S, and E{a) for all a E A, 
are zones. 

Clock zones, from which zones 5, and E{a), for all a G A, are built, are 
typically specified by clock constraints. Therefore, when we consider a timed 
automaton as an input of an algorithm, its size should be understood as the sum 



of sizes of encodings of L, C, A, 5, and q, and the sizes of encodings of clock 
constraints defining zones iS, and E(a), for all a E A. Our definition of a timed 
automaton may appear to differ from the usual ones [214] . but the differences 
are superficial. 

For a configuration s = (£, i/) G Q and t G , we define s + t to he the 
configuration s' = (£,v -\- 1) ii v + 1 € V, and we then write s We write 

s — *t s' if s s' and for all t' € [0,t], we have {£,1/ + t') G 5*. For an action 
a G A, we define succ(s, a) to be the configuration s' — [£' , v'\ where I' — 6(£, a) 
and ly' = reset(z^, g{a)), and we then write s s' . We write s s' if s s'; 
s, s' G S; and s G E{a). For technical convenience, and without loss of generality, 
we will assume throughout that for every s E S, there exists a € A, such that 
s — > s' . For s, s' G S*, we say that s' is in the future of s, or equivalently, that s 
is in the past of s', if there is t G M®, such that s -^t s'; we then write s — >* s'. 

For i?, R' G 7?., we say that R' is in the future of R, or that R is in the past 
of R', if for all s G -R, there is s' G R' , such that s' is in the future of s; we then 
write R — R'. Similarly, for R, R' G 7?., we write R ^ R' ii there is s G i?, and 
there is s' E R', such that s s'. 

A timed action is a pair r = (i, a) G M® x A. For s G Q, we define succ(s, r) = 
succ(s, {t, a)) to be the configuration s' = succ(s + t, a), i.e., such that s s" 
s', and we then write s -^t s' . We write s s' if s s" s', and we then 
say that (s, (i, a),s') is a transition of the timed automaton. If r = (t,a) then 
we write s ^ s' instead of s -^t s', and s s' instead of s -^t s' . 

An infinite run of a timed automaton is a sequence r = (sq, ti, si, T2, . . .), 
such that for all i > 1, we have Si-i s^. A finite run of a timed automaton is 
a finite sequence (sq, ti, si, T2, . . . , t„, s„} G S' x ((A x M®) x 5)*, such that for all 
«5 1 < * < fT-, we have Si_i ^ s^. For a finite run r = (sq, ti, si, r2, . . . , r„, s„), 
we define length(r) — n, and we define last(r) = s„ to be the state in which the 
run ends. For a finite run r — {sq, ti, si, T2, . . . , s„), we define time of the run as 
time(r) = J2^=i ^i- We write RunSfin for the set of finite runs. 

3.2 Strategies 

An average-time game is a triple (T, Lmih, ^Max), where T = {L, C, A, E, 6, g) 
is a timed automaton and (Liviin, -^^Max) is a partition of L. We define Quin ~ 

{{£, v) : £ & ^Min}, QMax = Q \ Quin, Suin = S Cl Quin, 'S'lVIax — S \ Suin, 

"^Min = {[s] : S G Quin}, and 7?.Max = 7?. \ T^Min- 

A strategy for Min is a function /i : Runsgn — > A x K®, such that if last(r) = 
s G iSMin and ii{r) = r then s ^ s', where s' — succ(s, r). Similarly, a strategy 
for player Max is a function x ■ RunSfin Ax R® , such that if last(r ) = s G ^Max 
and x(r) = r then s ^ s', where s' = succ(s,t). We write Uuin for the set of 
strategies for player Min, and we write Suax for the set of strategies for player 
Max. If players Min and Max use strategies /z and x, resp., then the (/x,x)-run 
from a state s is the unique run run(s, /i, x) = (so, ti, si, T2, . . .}, such that sq = s, 
and for every i > f, if G ^Min, or Si G -^Max, then ^(runj(s, ^, x)) = n+i, or 
X(runi(s, /x,x)) = r^+i, resp., where runi(s,/x,x) = (so,ri,si, . . . , Si-i,Ti, Si) . 



We say that a strategy /i for Min is positional if for all finite runs r, r' G 
Runsfin, we have that last(r) — last(r') implies /i(r) = fJ-{r'). A positional strategy 
for player Min can be then represented as a function fi : Suin Ax Mg, which 
uniquely determines the strategy G Suin as follows: /i°°(r) ~ /i(last(r)), 
for all finite runs r G RunSfin- Positional strategies for player Max are defined 
and represented in the analogous way. We write TTMin and iliviax for the sets of 
positional strategies for player Min and for player Max, respectively. 

3.3 Value of Average-Time Game 

If player Min uses the strategy ^ G ^Min and player Max uses the strategy 
X G Smux then player Min loses the value 

^Min(s,A*,x) = limsup- • time(run„(s,^,x)), 

n — >oo ^ 

and player Max wins the value 

Auaxis, fi,x) = liminf- • time(run„(s, ^, x))- 

n— >oo 71 

In an average-time game player Min is interested in minimising the value she 
loses and player Max is interested in maximising the value he wins. For every 
state s G of a timed automaton, we define its upper value by 

val^(s) = inf sup Auinis, fi,x), 

and its lower value 

yal^(s) = sup inf ^Max(s,MiX)- 

The inequality ya['^(s) < val'^(s) always holds. An average-time game is 
determined if for every state s G S*, its lower and upper values are equal to each 
other; then we say that the value val^(s) exists and val^(s) = val^(s) = val (s). 
We give an elementary proof for the determinacy of the average-time games 
without recourse to general results like Martin's determinacy theorem |14I15| . 

Theorem 2 (Determinacy). Average-time games are determined. 

For strategy /i G ^Min of player Min and x G ^Ma,K of player Max, we define 
val^(s) = sup^g^j^^.^^ ^Min(s,M,x), and val^(s) = inf^ei;„j„ ^Max(s, x). For an 
£ > 0, we say that a strategy /i G ^Min or x G -S'lviax is e- optimal if for every s G S 
we have that val'^(s) < val'^(s) -I- e or val^(s) > val'^(s) — £, respectively. Note 
that if a game is determined then for every e > 0, both players have e-optimal 
strategies. 

We say that a strategy x & ^Uax of player Max is a best response to a 
strategy /z G SMin of player Min if for all s G iS we have that -4Min(s, /x, x) = 



sup^/g^;^^,^ -^Min(s, /i, x')- Similarly we say that a strategy ^ e ^Min of player 
Min is a best response to a strategy x G ^Max of player Max if for all s G S* we 

have that ^Max(s,MiX) = inf^'ei^Min -^Max(s, m', x)- 

In the next section we introduce some region-based abstractions of timed 
automata, including the closed region graph, and its subgraphs: the boundary 
region graph, and the region graph. While the region graph is semantically equiv- 
alent to the corresponding timed automaton, the boundary region graph has the 
property that for every starting state, the reachable state space is finite. In Sec- 
tion [5] we introduce average-time games on these graphs and show that if we 
have the solution of the average-time game for any of these graphs, then we get 
the solution of the average-time game for the corresponding timed automaton. 
The key Theorem [5] follows immediately from Theorem |31 

4 Abstractions of Timed Automata 

The region automaton, originally proposed by Alur and Dill [2], is a useful ab- 
straction of a timed automaton as it preserves the validity of qualitative reach- 
ability, safety, and w-regular properties. The region automaton [5] RA(T) = 
{TZ, A^) of a timed automaton T consists of: 

— the set TZ of regions of T, and 

- M QTl X {n X A) X n, such that for all a e A, and for aU i?, R' , R" e 7^, 
we have that {R, R" , a, R') e X ilf i? R" ^ R' . 

The region automaton, however, is not sufficient for solving average-time 
games as it abstracts away the timing information. Corner-point abstraction, 
introduced by Bouyer et al. [5], is a refinement of region automaton which pre- 
serves some timing information. Formally, the corner-point abstraction CP(T) 
of a timed automaton T is a finite graph (F, E) such that: 

— V Q X TZ such that (s, R) ^ V iS. s — {(., v) e clos(i?) and v \s & cor- 
ner. Since timed automata we consider are bounded, there are finitely many 
regions, and every region has a finite number of corners. Hence the set of 
vertices finite. 

- E <ZV x{^(^xnx A)xV such that for (s, i?), (s', R') e V and (t, i?", a) G 
Me X 7^ X A, we have {{s, R),{t, R" ,a), {s' , R')) G S iff i? ^* R" ^ R' and 
{s + t) s'. Notice that such a f is always a natural number. 

Bouyer et al. [5] showed that the corner-point abstraction is sufficient for deciding 
one-player average-price problem if the initial state is a corner-state, i.e., a state 
whose clock valuation is a corner. 

In this section we introduce the boundary region graphs which is a generalisa- 
tion of the corner-point abstraction. We prove that the value of the average-time 
game on a timed automaton is equal to the value of the average-time game on 
the corresponding boundary region graph, for all starting states, not just for 
corner states. In the process, we introduce two other refinements of the region 



automaton, which we caU the closed region graph and the region graph. We collec- 
tively refer to these three graphs as region graphs. The analysis of average-time 
games on those objects allows us to establish equivalence of average-time games 
on the original timed automaton and the boundary region graph. We also show 
(Lemma [T]) that the value of an average-time game is constant over a region. 
A side-effect of this result is that the corner-point abstractions can be used to 
solve average-time games on timed automata for arbitrary starting states. 

4.1 Region Graphs 

A configuration in region graphs is a is a pair {s, R), where s G Q is a configu- 
ration of the timed automaton and i? e 7?, is a region; We write f2 for the set of 
configurations of the region graphs. For a set X C J7 and a region _Ro G T^, we 
define the set X restricted to the region Rq as the set {{s,R) ^ X : R — Rq}, 
and we denote this set by X{Ro). For a configuration q ~ {s,R) G i? we write 
write [q] for its region R. 

Definition 1 (Closed Region Graph). The closed region graph T ~ {S,E) 
of a timed automaton T is a labelled transition system, where: 

— S is the set of states defined as 

S = {(s, R) e Q : se clos{R)} and 

— E is the labelled transition relation defined as 

E = {((s, R), {t, R", a), (s', R')) G x (M© x 7^ x yl) x 

: R — >* R" R' and s' = succ{s, {t, aj) and s + t E clos{R")}. 

Definition 2 (Boundary Region Graph). The boundary region graph T = 
(S, E) of a timed automaton T is a labelled transition system, where: 

— S is the set of states defined as 

S = {{s, R) e [2 : s e clos{R)} and 

— E is the labelled transition relation defined as 

E = {((s, i?), (i, R", a), (s', R')) G 5 X (Re X 7^ X A) X 

: R R" R' and s' = succ{s, {t, a)) and s + te bd{R")}. 

Boundary region graphs have the following remarkable property. 

Proposition 1 ([17J). For every configuration in a boundary region graph the 
set of reachable configurations is finite. 

We say that a configuration q — {s — {£, v), R) is corner configuration if v is 
a corner. 



Proposition 2. The reachable sub-graph of the a boundary region graph T from 
a corner configuration is same as the corner-point abstraction CP{T). 

Definition 3 (Region Graph) . A region graph of a timed automaton T is a 
labelled transition system T = {S,E), where: 

— S is the set of states defined as 

S= {{s,R) G Q : s€R} and 

— E is the labeUed transition relation defined as 

E = {{{s, R), {t, R", a), (s', R')) e 5 x (M® x 7^ x A) x 5 

: R ^* R" R' and s' = succ{s, {t, a)) and s + t G R"}. 

For configuration q = (s, R) e i7, real number t G M®, region R" G TZ, and 
action a £ A, we write succ(g, {t, R", a)) for the configuration (succ(s, {t, a)), R') 
where R" R' . 

4.2 Region Game Graphs 

For r = (T, iMin, -t-Max) wc define the sets Ouin = {{s,R) G f2 : R G T^Min} 
and Ouax = f^uin- Similarly we define sets 5Min, Suax, Suin, Smdx, Suin, 
and 5Max- The timed game automaton 7^ naturally gives rise to the closed re- 
gion game graph F — {T, Suin, Suax), the boundary region game graph F = 
(T, SMinjS'Max), and the region game graph F = (T, ^Min, ^Max). When it is 
clear from context, we use the terms region graphs and region game graphs in- 
terchangeably. Also, sometimes, we write T, T, T, and T for F, F, F, and F, 
respectively. 

4.3 Runs of Region Graphs 

An infinite run of the closed region graph T = {S, E) is an infinite sequence 

(go,n,gi,Ti,...) e5x ((Ke x 7^ x A) xSy, 

such that for every positive integer i we have {qi-i,Ti,qi) G E. A finite run of 
the closed region graph T is a finite sequence 

{qo,Ti,qi,Ti,...,qn) G S x ((M© x U x A) x S)* , 

such that for every positive integer i < n we have Tj, qi) G E. Runs of the 

boundary region graph and^the region graph are defined analogously. 

For a graph Q G {T, T, T} we write Runs^ for the set of its runs and Runs^ (q) 
for the set of its runs from a state q G Q. We write Runsgj^ for the set of finite 
runs and Runsgj^(g) for the set of finite runs starting from q G S. 



4.4 Pre-Runs and Run Types 



Pre-runs [T^ generalise runs of T, T, and T, and allow us to compare the runs in 
T, T, and T in a uniform manner. On the other hand, the concept of the type |12j 
of a run allows us to compare pre-runs passing through the same sequence of 
regions. 

A pre-run is a sequence {{sq, Rq), {ti,R[,ai), (si, i?i), . . .) e i7 x ((M® x 7^ x 

A) X J?)'^, such that s^+i = succ(si, (ti+i,ai+i)) and Ri — R'^+i °'^^> for 
every i G N. We write PreRuns for the set of pre-runs and PreRuns(s, R) for the 
set of pre-runs starting from (s, R) € fi. The relation between various sets of 
runs is as follows: for all g £ Q we have 

Runs"^((7) C Runs'^((7) C PreRuns((7) and 
Runs^(g) C Runs^((7) C PreRuns(g). 

A finite pre-run is a finite sequence ((sq, i?o)j (^ii ^'ij oi), . . . , (s„, Rn)) € (Qx 
TV) X ((M® X 7?. X A) X (Q X 7?.))* such that for every nonnegative integer i < n 
we have that s^+i = succ(si, (ij+i, aj+i)) and Ri R[ '^^ > Ri. We write 
PreRunSfin for the set of finite pre-runs and PreRunSfin(s, i?) for the set of finite 
pre-runs starting from (s, R) 6 Q. For finite run r — (go, (ii, oi), 9i, • ■ • , f?™) € 
PreRunSfin we define its total time as time(r) = ^"^^ we denote the last 

state of the run by last(r) — Qn- 

A run type is a sequence (i?o, (-R'l, ai), -Ri, (^2: "^a), • • •) e 7?. x ((7^ x A) x 
7?.)'^ such that for every i e N we have that Ri R'i+i — * Ri+i- We say 
that a pre-run r = ((sq, i?o), (^i, -R'l, ai), (si, 7?i), (ti, -R2' '^a), ■ • •) is of the type 
(i?o, (i?i,ai),i?i, (i?2.a2), . . .}. We say that a run r = (sq, (ti,ai),si, (^2,02), ■ ■ •) 
of a timed automaton T is of the type {Ro,{R'i,ai),Ri,{R2,a2), ■ ■ ■), where 
Ri = [si] and -R^^i = [si + ti+i] for all i £ N. We also define the type of a finite 
runs analogously. 

For a (finite or infinite) run or pre-run r, we write |r]7j for its type. We write 
Types for the set of run types, and we write Types(i?) for the set of run types 
starting from region R € TZ. Similarly we write Typesgjj for the set of finite 
run types, and we write TypeSfijj(i?) for the set of finite run types starting from 
region R G TZ. 

5 Strategies in Region Graphs 

In this section we define strategies of players in region graphs T, T, and T, and 
study some of their properties. Strategies in T are called admissible strategies, 
while strategies in T are called boundary strategies. We also introduce so-called 
type-preserving boundary strategies which are a key tool in proving the cor- 
rectness of game reduction from timed automata to boundary region graph. In 
Section [6] we show that there are optimal type-preserving boundary strategies in 
T and T. 



5.1 Pre-strategies and Strategies in T, 7", 7" 

Pre-strategics generalise the concept of strategies in region graphs, and allows us 
to discuss the strategies in T, T, and T in a uniform manner. We first define pre- 
strategies for players in T, and then using that we define strategies for players 
in closed region graph, boundary region graph, and region graph. 

Definition 4 (Pre-strategies). A pre-strategy of player Min /i is a (partial) 
function fj, : PreRunsfin Ke 'x x A, such that for a run r £ PreRunsj,.,,. 
if last{r) = {s,R) G fiMm then /i(r) — {t,R",a) is defined, and it is such that 
R — »■* R" ^ R' for some R' S TZ. Pre-strategies of player Max are defined 
analogously. We write ^Min ^''^'^ ^Max /'"^ of pre-strategies of player Min 

and player Max, respectively. 

We say that a strategy of player Min /i G -^Mhi positional if for all runs 
ri,r2 G PreRunsfin we have that last(ri) = Iast(r2) implies /x(ri) = /(x(r2). Simi- 
larly we define positional strategy of player Max. 

We define the run starting from configuration q G f2 where player Min and 
player Max use the strategies fj, G ^Min X G ^m^' respectively, in a straight- 
forward manner and we write run(g', /j., x) for this run. For every positive integer 
n we write run„((7, fi, x) for the prefix of the run run((7, fi, x) of length n. 

Now we are in a position to introduce strategies in closed region graph, region 
graph, and boundary region graph. 

Definition 5 (Strategies in Closed Region Graph). A pre-strategy of player 

Min fi G ^Min strategy in a closed region graph T = {S,E) if for every run 
r G PreRunsfin such that iJt{r) = {t,R',a), we have that (s + t) G clos(R') where 
{s,R) = last{r). Strategies of player Max in a closed region graph are defined 
analogously. We write S Min and Smux for the set of strategies of player Min 

and player Max, respectively. 

Definition 6 (Strategies in Region Graphs). A pre-strategy of player Min 
jjL G S'^M^^ is a strategy in a region graph T = (S, E) if for every run r G 

Run.sji.^^ such that /i(r) = {t,R",a), we have that {s -\- 1) G R" where (s,i?) = 
last{r) . Strategies of player Max in a region graph are defined analogously. We 
call such strategies admissible strategies. We write SMin and I^Max for the set 
of admissible strategies of player Min and player Max, respectively. 

Definition 7 (Strategies in Boundary Region Graph). A pre-strategy of 
player Min /i G ^'miu strategy in a boundary region graph T = [S, E) if for 
every run r G PreRunsfin such that ii{r) = (t, R' ,a), we have that 

t = mi{t : s + tG clos{R')}, (1) 

where (s,i?) = last{r). 

A pre-strategy of player Max x G ^mIx " strategy in a boundary region graph 
T if for every run r G PreRunsfin such that n{r) = {t,R',a), we have that 



t = sup{i : s-\-tG clos{R')}, 



(2) 



where {s,R) = last{r). We call such strategies boundary strategies. We write 
SMin and SMax for the set of boundary strategies of player Min and player Max, 
respectively. 

For notational convenience and w.l.o.g., in the definition of boundary strate- 
gies, we do not consider those timed moves of player Min (Max) which suggest 
waiting till the farther (nearer) boundary of a thick region. 

Remark 1. For every state s £ S of timed automata T and every strategy /U e 
^Min ^^'^ X € ^Ma.^ respective players, we have that : 

- run((s, [s]),/i,x) e Runs^(s, [s]) if /i e Z'Min and x € i^Max; 

- run((s, [s]),/i,x) S Runs^(s, [s]) if /x e i^Min and x e ^Max; 

- run((s, [s]),/x,x) e Runs^(s, [s]) if /U G ^Min and x G i^'Max- 

Boundary Strategies and Boundary Timed Actions. Define the finite set 

of boundary timed actions A = dfcDisj x C x A. For s E Q and a = {b, c, a) G A, 
we define t{s,a) = b — s{c) if s(c) < b, and t{s,a) = if s(c) > b; and we 
define succ(s,a) to be the state s' = succ(s, T(a)), where T{a) = {t{s,a),a); 

we then write s s'. We also write s s' if s ^l^i^ s'. For configuration 

q = (s, R) G fi, boimdary timed action a = (b, c, a) G A, and region R" G TZ we 
write succ(<7, {a, R")) for the configuration succ{q, {t{s, a), R" , aj). 

Timed actions suggested by a boundary strategies are precisely boundary 
timed actions. Tlic following proposition formalises this notion. 

Proposition 3. For every boundary strategy a G SMmi^Max) of player Min 
(Max) and for every run r G PreRunSfin, if o'{r) = {t,R',a) then there exists a 
boundary timed action a = (6, c, a) G A such that t{s, a) = t, where (s, R) = 
last{r). 

Proof. Let run r G PreRunsfin be such that last(r) = {s,R). Let a G i^Min be a 
boundary strategy of player Min such that a{r) = {t, R', a). From the definition 
of the boundary strategies, we have that t = m{{t : s + t G clos(ii')}. To prove 
the proposition, all we need to show is that there exists an integer 6 G Z and a 
clock c G C, such that b — s(c) = t. 

If R' G 7?.Thin then there exists a clock c' £ C such that for all states 
s' G clos(i?') we have that ls'{c')^ = 0. In this case the clock c = c' and the 
integer b= {s + t) (c) . 

If i? G 7?.Thick and let R' R be the thin region immediately before R. Let 
clock c' G C be such that for all states s' G clos(i?') we have that ls'{c')\ = 0. 
Again, in this case the desired clock c = c' and the integer b= {s + t) (c) . 

The case, where cr is a strategy of Max is similar, and hence omitted. □ 

Sometimes, in our proofs we need to use boundary timed action suggested by 
a boimdary strategy. For this purpose wc define the notation ff{r) that gives the 
boundary timed action and region pair that corresponds to a{r). The definition 
of this function is formalised in the following definition. 



Definition 8. For a boundary strategy a € ^Maxi^Max) of player Min (Max), 
we define the function a : PreRunsfin — > (A x TV) as follows: if for a run r G 
PreRunsfin we have a(r) = (i, R' , a), then 'cf{r) = {{b, c, a), R') such that 6— s(c) = 
t, where {s,R) — last{r). 

5.2 Type-Preserving Boundary Strategies 

We now introduce an important class of boundary strategies called type-preserving 
boundary strategies. Broadly speaking, these strategies suggest to players a unique 
boundary timed action and region pair for all the finite runs of the same type. 

Definition 9 (Type-Preserving Boundary Strategies). A boundary strat- 
egy a e Smiu of player Min is type-preserving if IrJ^ = [[?'2l7Z implies a{ri) = 
^{^2) for all ri,r2 € PreRunSfin- Type-preserving boundary strategies of player 
Max are defined analogously. We write Smiu o,nd SMax for the sets of type- 
preserving boundary strategies of players Min and Max, respectively. 

The rationale behind the name type-preserving is that if /i £ E^Mi-a and x G "Max, 
then for every R € TZ and for q, q' G ^(-R), the run types of the resulting runs 
from q and q' are the same, i.e., lwn{q, ij,,x)jn = [run(q', /z, x)!?^- 

Simple Functions. Let X C H. A function F : Q ^ R is simple [311 Ij if 
either: there is e G Z, such that for every q = (s, R) G X, we have F{q) = e; 
or there are e G Z and c G C, such that for every q = {s,R) G X, we have 
F{q) = e — s(c). We say that a function F : X ^ is regionally simple or 
regionally constant, respectively, if for every region R G TZ, the function F, over 
domain X{R), is simple or constant, respectively. 

For regions R, R' , R" G TZ and boundary timed action a = {b, c, a) G A, we 

write R >a R' if one of the following holds: 

— R -^b,c R" i?', or 

— there is region R'" G 7?.Thin such that R ^b,c R'" — >+i R" — * -R', or 

— there is a region R'" G 7?.Thin such that R — >b,c R'" <— +1 R" R' ■ 

Properties of Type-preserving Boundary Strategies. The next two propo- 
sition state that if both players play with type-preserving boundary strategies 
then for every n G N the total time spent in n transitions is regionally simple 
(Proposition |4]), and the average time of the infinite nm is regionally constant 
(Proposition [S]) . 

Proposition 4 (Type-preserving strategy pairs yield regionally simple 
time for finite runs), ///i G E^Min, X G ^Maxi and n G N, then time{runn{-, /i, x)) : 

Q — *■ is regionally simple. 

Proposition 5 (Type-preserving strategy pairs yield regionally con- 
stant average tirne). If ^ e Smih and x £ ^Max then -4Min(-,M, x) ■ Q 
and Amaxi:, fJ-^x) ■ Q ^ are regionally constant. 



Type-preserving Boundary Strategy that Agrees with a Boundary 
Strategy. Given an arbitrary boundary strategy a and a configuration q G Q, 
sometimes we are interested in a type-preserving boundary strategy that agrees 
with a for all the runs starting from q. We denote such a strategy by cr-^*. The 
following definition formalises such strategy 

Definition 10. For a boundary strategy /i G ^Min of player Min and q Cz Q we 

define ii^'^ G ^Min to be a type-preserving boundary strategy which satisfy the 
following conditions: 

1. iJL^i{r) = fiir) for every r G PreRunsfiniq), and 

^- Wt?. = I^'Itc implies /^^^(r) = ii^i{r') for all runs r,r' G PreRunSfin- 
For X G ^Max and q G Q we define x^"^ G — Mas analogously. 

Given an arbitrary strategy /i G -S'Min of player Min, a type- preserving bound- 
ary strategy x S S'Max of player Max, and a configuration q (z Q sometimes we 
require to specify a type-preserving strategy /i^'^'X) G S'Min which has the prop- 
erty that types of runs run(q, /i, x) and run(g, /i*^''^) , x) are the same. We then 
argue that from configuration g G Q if player Max plays according to x G ^Max 
then player Min can achieve better average-time if she plays according to /x*^*'^^ 
(see Proposition [S] and Corollary [T|) . The motivation for the definition of x'"'''^'' 
is similar. 

Definition 11. For an arbitrary strategy ii G S Min of player Min, a type- 
preserving boundary strategy x G ^Max of player Max, and a configuration 
q — (s, R) G Q, we define /i'^'^^ G E^Min to be a type-preserving boundary strategy 
which satisfy the following conditions: 

1. |run(g, , x)ln = {runisi: A^, x)lR^nd 

2. Irln = It'Itz implies /i(9^x)(r) = ^(9.x)(r') for all runs r,r' G PreRunsfin- 

For X e ^Max, fJ- G ^Min, and q e Q the strategy x*-''^'' £ ^Max is defined 
analogously. 

The following proposition and its corollary shows that starting from a con- 
figuration q player Min (Max) prefers /i^"?'^) (x^'^''^'') to fj, (x) against a type- 
preserving strategy x 6 ^Max (m G ^Min) of its opponent. 

Proposition 6. For every x £ ^Max, fJ- G ^ Min and q E Q we have that 
time{runn{q,ti,x)) > time{runn{q, n^'^'^Kx)), 

for every n G N. Similarly, for every /i G E^Min, X G ^Max and q E Q we have 
that 

time{runn{q,n,x)) < time{runn{q, fJ,,x^'^'''^)), 

for every n G N. 

An easy corollary of this proposition is as follows: 



Corollary 1. For every x G ^Max, M ^ ^Min and for all configurations q G Q 
we have that 

Similarly for every ii £ E^Min, X G ^ Max and for all configurations q G Q we 
have that 

AMax{q,l^,X)) < AMax{q,li,X''^'^^))- 

Admissible Strategies e-Close to a Type-Preserving Boundary Strat- 
egy. Given a type-preserving boundary strategy a and a positive real e > 0, 
sometimes we are interested in admissible strategies that behave like a within e 
precision. The following definition formalises such strategy. 

Definition 12. For fi e ^Mm and a real number s > 0, we define the set of 
admissible strategy -S'^in'' ^ ^Min as follows. For every fj,^ £ ^mI^ have that 
for all runs r £ PreRunSfin m(^) = ((^j c, a), i?') then /ie(r) = (t,R',a) is such 
that 

s + t £ R' and t <b - s(c) + e, 

where {s,R) = last{r). Notice that (see EquationU^ such a value oft always 
exists. ^ ^ 

Similarly for x G ^Max and a real number e > we define the set s''^'^^ C Emux 
as follows. For every Xs S ^Max have that for all runs r £ PreRunsfin if 
x{f) = {{h,c^a), R') then Xe{^) — {iiR'iO) is such that 

s + t £ R' and t >b — s(c) — e, 

where (s,i?) — last{r). 

Given an arbitrary strategy /i £ Smui of player Min, a positive real e > 0, a 
type-preserving boundary strategy x G •^Max of player Max, an e-close strategy 
Xe S ^Max^ I ^"^d a configuration q £ Q sometimes we require to specify a type- 
preserving strategy fj.^'''^^'^ £ ^Min which has the property that types of runs 
run(g,^,Xe) and run{q, fj,^'i-^^\xe) are the same. 

Definition 13. For an arbitrary strategy /i £ S Min of player Min, a positive 
real e > 0, a type-preserving boundary strategy x £ ^Max of player Max, an 
e-close strategy Xe G ^'uax ' ^'^'^ ^ configuration q — {s,R) £ Q, we define 
to be a type-preserving boundary strategy which satisfy the fol- 
lowing conditions: 

1. lrun{q,^x^'i-'^'-\xe)\TZ =Jnm{q, ^i,Xe)lR^and 

2. \r\n = \r'\'R. implies fi'-i^^^') (r) ~ /i^'^'-x^) (r') for all runs r,r' £ PreRunsfin. 

Combining it with Definition\12\we get that |run(g, fj,^'^-'^^\ x)1k = Xe)lK 

For X € SMax, Xe £ ^MaJ , t^ S Smiu, and q £ Q the strategy x^'^'^''' £ ^Max is 
defined analogously. 

We need the following property of n^i'^^^ and x''-'^''^^^ strategies. 



Proposition 7. For every arbitrary strategy /i G S Min, positive real e > 0, 
type-preserving boundary strategy x G ^Max of player Max, e-close strategy Xe G 
^Max '^f plo,ysi~ Max, and q E Q we have 

time{runn{q,n,Xe)) > time{run,,{q, fi''''-^'-\x)) -n-e, 

for every n G N. Similarly for every arbitrary strategy x G ^Max, positive real 
£ > 0, type-preserving boundary strategy fi G E^Min of player Max, e-close strategy 
He G ^Min '^f P^o^V^f Min, and q Q we have 

time{runn{q,He,x)) < t/'me(run„(g, x^''""'^)) + n • £, 

for every n G N. 

The following result is an easy corollary of Proposition [71 

Corollary 2. For every x e ^Max, M ^ ^Mm, £ > 0, Xe e ^MaJ ' Q ^ Q 
we have that 

AMax{q,l^,Xe)) > AMax{q,lJ''''^''^'\x)) " 

Similarly for every fi G ^Min, X G ^Max, £ > 0, G ^Mm' "'^^ q ^ Q we have 
that 

AMzn{q, Me, x) < Ammiq, x*-''^'-') + £• 

To summarise the relations between various strategies, note that the following 
inclusions hold: 

^Min C ^Min C Suin C ^^in ™^ ^Min C Z'Min C i7^Yn' ^'^'^ 
"Max ^ 'S'Max ^ 'S'Max ^ -^Max 'S'Max ^ 'S'Max ^ -E'^'^^. 

6 Average-Time Games on Region Graphs 

We define ^Min : ^ x IJ^°^ x S^^^ ^ and ^Max : f2 x S^^ x i;P''^^^ ^ 
in the following manner: 

AMin{q,l^,x) = limsup- •time(run„((7,^,x)), and 

n— i-oc 

^Max(9,Ai,x) = liminf i • time(run„(g, ^, x)), 

n — »oo fi 

where fi G ^Min' X ^ -^Max and q <E Q. For average-time games on a graph 

^ ~ „ (J 

5 G {T, T, T} we define the lower- value val {q), the upper- value val [q) and the 

value val^(g) of a configuration g G Q in a straightforward manner. 

From construction it clear that the difference between an average-time game 
on a timed automaton and the average-time game on corresponding region graph 
is purely syntactical. Hence if the average-time game on region graph T is de- 
termined then average-time game on timed automaton T is determined as well. 



Proposition 8. An average-time game on timed automaton T is determined, if 
the corresponding average-time game on region graph T is determined. Moreover 
for all s S we have that val{s) =^ vaF (s, [s]). 

The following is the main result of this section. 

Theorem 3. Let T be a timed automaton. Average-time games on the timed 
automaton T, the closed region graph T , the region graph T , and the boundary 
region graph T are determined. Moreover for every s € S in a timed automa- 
ton T , we have: 



This theorem follows from Theorem 21 Theorem [SJ Theorem [71 and Proposi- 
tion [i 

Moreover Theorem [3] and Proposition [5] let us conclude the following lemma 
about the value of average-time games on timed automata. 

Lemma 1. The value of every average-time game is regionally constant. 

An interesting implication of Lemma [1] is that corner-point abstraction is 
sufficient to solve average-time games with an arbitrary initial state. 

6.1 Determinacy of Average-Time Games on the Boundary Region 
Graph 

Positional determinacy of average-time games on the boundary region graph is 
immediate from Proposition [1] and Theorem [1] 

Theorem 4. The average-time game on T is determined, and there are optimal 
positional strategies in T , i.e., for every q G Q, we have: 

vaF{q)= inf sup y4,Mtn(g, A^, x) = sup inf AMax{q, IJ;X)- 

In fact, in a boundary region graph, there are optimal type-preserving bound- 
ary strategies. Before we show that, we need the following result. 

Lemma 2. In T, i/ /i G ^Min and x G ^Max are mutual best responses from 
q G Q, then fx^"^ S SMin and x^"^ £ ^Max are mutual best responses from every 



Auiniq', ll^\ X^') = ^Mi„(g, l^^\X^') > Auiniq, f^^' , X^^') = 



The first equality follows from Proposition [51 the inequality follows because x 
is a best response to /i from q; the second equality follows from Proposition [5l 



vaF{s) 



vaF{s,[s]) = var{s,[s]) 



var{s,[s]). 




again; and the last equality is straightforward. 



□ 



Theorem 5. There are optimal type-preserving boundary strategies in T , i.e., 
for every q <E Q, we have: 

vaF{q)= inf sup Aminiq, ^^,x) = sup inf Auaxiq, ^J.,x)■ 
Proof. Let /i* e E^Min and x* G —Max be mutual best responses in T; existence 
of such strategies follows from Lemma [2J Moreover, we can assume that the 
strategies /i* and x* have finite memory; this can be achieved by taking positional 
strategies G ^Min and x £ ^Max in Lemma We then have the following: 

inf sup Auiniq, IJ; X) < sup ^Minl*?, = -^Min(g,A^*,X*) = 

^Max('?,A^*,X*) = inf ^Max(g,/^,X*) < sup inf ^Max(g,M,x)- 

The first and last inequalities are straightforward as fi* G E^Min and x* G S'Max- 
The first equality holds because x* is a best response to fi* in T, and the third 
equality holds because /i* is a best response to x* in T. Finally, the second 
equality holds because strategies /i* and x* have finite memory. □ 

6.2 Determinacy of Average-Time Games on the Closed Region 
Graph 

To be able to show the determinacy of the average-time games on the closed 
region graph, we need the following intermediate result. 

Lemma 3. In T, for every strategy in there is a best response in ^Maxi 

and for every strategy in Smux there is a best response in SMin- 

Proof. We argue that if G ^Min is best-response to x £ —Max from q £ Q 
then the strategy /i^'^''^^ is best-response to x from every q' G Q{[q]). For all 
M G Smin we have the following: 

Auiniq', M^'^^^ , X) = Auiniq, t^^"^""^ , X) < Auiniq, X) < Auiniq, M('?'''^) , x) = 

^Min(g',M(«''X),X) < AMin{q',fi',x)- 

The first and the second equalities follow from Proposition the second in- 
equality follows because /z is a best response to x from g; and the first and the 
third inequalities follow from the the Corollary [TJ It follows that in T for every 
strategy x G S'Max there is a best response in S'Min- Similarly we prove that in 
T for every strategy ^ G Suin there is a best response in ^E'lviax- □ 

Theorem 6. The average-time game on T is determined, and there are optimal 
type-preserving boundary strategies in T , i.e., for every q £ Q, we have: 

vaF{q)^ inf sup ^Mml?, A^, x) = sup inf Amaxiq, IJ^, x) ^ ^aF (q). 

^ ^ " X e ^ Max X e Ha; ai e S Mm 



Proof. We have the following: 



inf sup AMin{q,fJ;X) = inf sup Auiniq, ^J',X) = 

sup inf ^Max('?,M:X) = sup inf y4.Max('?, M, x), 

where the first and last equalities follow from Lemma [31 and the second equality 
follows from Theorem [51 _ 

Now we show that val^(q) > val^(q). The proof that val (q) < val (q) is 
similar and hence omitted. 

yai^(g) = sup inf AuiiAq, x) > sup inf ^Maxlg, x) 

Xel^Max MSl^Min X^-Max M^i^Min 

= sup inf ^Max(g, M.x) = yal^(g). 

XeSMax /'E-Min 

The first inequality follows as S'Max ^ ^Max- The first equality holds by defini- 
tion, the second equality is proved in the first paragraph of this proof, and the 
third equality follows from Theorem [5l From Lemma [H we know that val^(q) = 

val (q). It follows that the average-time game on T is determined, and there are 
optimal type-preserving boundary strategies in T. □ 



6.3 Determinacy of Average-Time Games on the Region Graph 

Lemma 4. If the strategies fi* G E^Min andx* G ^Max o-fs optimal for respective 
players in T then for every e > 0, we have that 

sup AMm{q,lJ-t,x)<v3F{q)+e and inf AMax{q, H,xt) > ^^1^ (q) ~ 
for allyil^S^^^'^ andxl^S^£f. 

Proof. Let /i* G Eyiin and x* £ S'Max are optimal for respective players in T. 
For all X ^ ^Max, e > 0, and ^* e we have the following: 

-4Min(<7, mJ,x) < -4Min(q,Ai*,X^'''''"'')+£ < ^Min(g,Ai*,X*)+£ = val^((7)+e. 

The first inequality is by Corollary [2l The second inequality holds because x* 
is an optimal strategy and the equality is due to the fact that ji* and x* are 
optimal. □ 

Theorem 7. The average-time game on T is determined, and for every q & Q, 
we have vaF{q) — vaF [q). 



Proof. Let /i* G ^Min be an optimal strategy of player Min in T. Let us fix an 

^Mii 



e > and /i* G S^^' 



T 

val (q) = inf sup ^Min(g, A^, x) < sup Auiniq^ f^liX) < 

sup y^Minlg, MeiX) < val^(g) +e. 

The second inequality follows because fi* G Smin and the third inequality follows 
as Syiax Q ^Max- The last inequality follows from Lemma [4] because /i* G Suin 
is an optimal strategy in T. Similarly we show that for every e > we have that 

ya\^{q) > val'^(g) — e. Hence it follows that val'^(g) exists and its value is equal 
to val^(g). □ 



7 Complexity 

The main decision problem for average-time game is as follows: given an average- 
time game r = (T, LMin, ^Max), a state s G S*, and a number B G M®, decide 
whether val(s) < B. 

Theorem 8. Average-time games are EXPTIME-complete on timed automata 
with at least two clocks. 

Proof. From Theorem [3] we know that in order to solve an average-time game 
starting from an initial state of a timed automaton, it is sufficient to solve the 
average-time game on the set of states of the boundary region graph of the au- 
tomaton that are reachable from the initial state. Observe that every region, and 
hence also every configuration of the game, can be represented in space polyno- 
mial in the size of the encoding of the timed automaton and of the encoding of 
the initial state, and that every move of the game can be simulated in polynomial 
time. Therefore, the value of the game can be computed by a straightforward 
alternating PSPACE algorithm, and hence the problem is in EXPTIME because 
APSPACE = EXPTIME. 

In order to prove EXPTIME-hardness of solving average-time games on timed 
automata with two clocks, we reduce the EXPTIME-complete problem of solving 
countdown games ilOj to it. Let G — {N, M,TT,no, Bq) be a countdown game, 
where iV is a finite set of nodes, M C x is a set of moves, tt : M ^ N+ 
assigns a positive integer number to every move, and (no, Bq) G N x N+ is the 
initial configuration. 

W.l.o.g we assume that there is an integer W such that 7r(ni,n2) > W for 
every move (ni,n2) G M. {n,B) G A x N+, first player 1 chooses a number 
p G N+, such that p < B and n{n, n') — p for some move (n, n') G M, and then 
player 2 chooses a move (n, n") G Af, such that 7r(n, n") = p\ the new configura- 
tion is then (n", B — p). Player 1 wins a play of the game when a configuration 
(n,0) is reached, and he loses (i.e., player 2 wins) when a configuration {n,B) 



is reached in which player 1 is stuck, i.e., for all moves {n,n') G M, we have 
7r(n,n') > B. 

We define the timed automaton Tq = {L, C, S, A, E, 6, ^, F) by setting C = 
{b,c}; S = L X ((|BoDr)2; A = {*}UPUM, where P = 7r(M), the image of 
the function n : M ^ N+; 

L = {*}U N \J {{n,p) : there is (n, n') € s.t. 7r(n, n') = p}; 
{{n, u) : n & N and = Bq} if a = *, 

{(*, ly) : v{c) = W} if a = *, 

{ (n, z^) : n') E M, s.t. 7r(n, n') — p and v{c) = O} if a = p € P, 
{((n,p),j^) : n{n,n') ~ p a.nd iy{c) = p} if a = {n,n') £ M, 

* if £ = n G N and a = *, 

* if = * and a = *, 
(n, p) if £ = n £ N and a = p G P, 
n' if £ = {n,p) G N X P and a = (n, n') e M; 



i?(a) 



<5(^, a) = < 



^(a) = { c }, for every a G A \ { * } and ^(*) = { 6, c }. Note that the timed 
automaton Tq has only two clocks and that the clock b is reset only in the special 
location *. 

Finally, we define the average-time game on timed game automaton Pq = 
(Tg, Li, L2) by setting Li = N and L2 = i \ Li. It is routine to verify that 
value of the average-time game at the state (no, (0, 0)) G 5 is in the average- 
time game on Fq if and only if player 1 has a winning strategy (from the initial 
configuration (no, -Bo)) in the countdown game G. □ 
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A Proof of Proposition H] 



In order to prove this proposition, we need the following result. 

Proposition 9 ( [TTIfTTl ). Let a E A and regions R, R' , R" E TZ be such that 
R R'- If F : Q{R') R zs simple then F® : i7(i?) R, defined as 

(s, R) I— > t{s, a) + F{succ{q, {a, R"))), is simple. 

Proof (Proof of Proposition^. Let /i G ^Min and x G ^Mux- We prove this 
lemma by induction on the value of n. The base case for n = is trivial. Assume 
that for every € ^Min and x G •^Max the function time(runj;(-, ^, x)) ■ Q ^ Re 
is regionally simple. To prove this proposition we now need to show that for 
fj, € S'Min and x € S'Max the function time(runfe+i (•, /z, x)) is regionally simple. 

Let the strategies /i' S E^uin and x' £ •5'Max be such that for every q € Q 
the run runfc(succ(q, /i, be the length k suffix of runk+i{q, fJ-iX)- From 

inductive hypothesis we have that runk{-, fJ-' ,x') is regionally simple. Assume 
that R e T^Min and let /i(((z}) = {(^,R") for every q e Q{R). The treatment for 
the case where R E TZmux is similar. Now for every q = {s,R) E Q{R) we have 
that time(runfc+i(g, ^, x)) = t{s,a) + runfc(succ(q, (a, i?")), xOj which from 
Proposition [9] is a simple function. □ 



B Proof of Proposition [5] 

Proof. Let E Suin, X £ ■^Max and q = (s, R), q' — (s', R) E Q{R). We have 

Aminiq, x) - -4Min(g', x) 

= liminf(l/n) ■ time(run„((7, /x, x)) — liminf(l/n) • time(run„(q', /i, %)) 

n — >C30 n — >oo 

= liminf(l/n) • (6 - s(c) - fe + s'(c)) = liminf(l/n) • (s'(c) - s(c)) = 0. 

n — >oo n — 'oo 

The first equality is by definition, the second follows from Proposition |4l and the 
last two equalities are trivial. In a similar manner we show that .4Max('?, x) = 

^Max('?',M,X)- □ 



C Proof of Proposition [HI 

In order to prove this Proposition [5] and Proposition [71 we need the following 
result. 

Proposition 10 ( jlllllT] ). Let a E A and regions R,R',R" E TZ be such that 
R R" R'. If F : n{R') ^ R is simple then for every q = (s,i?) E n{R), 
function F^^ : / ^ R, defined as t i-^ t + F[succ{q, {t, R" , a))), is continuous 
and nondecreasing, where I — {t E R© : {s + t) E clos{R")}. 



Proof (Proof of Proposition\E^. The proof is by induction on n. The base case, 
when 71 = 0, is trivial. In the rest of the proof we show that for x G —Max, fJ- G 
-^Min, and a configuration q — {s,R) G Q, we have that time(runfe+i(g, /i, x)) > 
t\me{runk+i{q, fJ.'^'^'^^x)) assuming that the proposition holds for n — k. The 
proof for the case where q € Quax trivial. In the rest of the proof we assume 
that q G Qjviin- _ 

Let usfixx e ^Max and^ G ^Min- Let runk+i{q, fJ.,x) and runk+i{q, fi^'^'^\x) 
be (go, Ti, 91, • • ■ , Qk+i) and (g^, T[,q[, q'^+i), respectively, where qo ^ q'o = q- 
Notice that by definition the run types of both runs are the same. Hence for 
every index i < fc + 1 we have qi = {si, Ri) and q[ — {s[,Ri), and for every index 
i < fc + 1 we have — [ti,R[,ai) and t[ = {t[, R[, ai). 

Let X G H'lviax and M G Syvia be such that runfc(qi, M, X) be length k suffix 
of run/j+i(q, /i, x). Notice that we assume that X is type-preserving. It is easy to 
see that 

time(runfc+i((7, /X, x)) = ii + time(runfc((7i, M, X)). 
From inductive hypothesis, we get that 

time(runfc+i(g,^,x)) > ti + time(runfc(gi, M^^^'^^ X)). (3) 

Since the strategies M'^I''^^ G E^Min and X G S'Max are type-preserving, from 
Proposition [4] we get that time(runfc(-, Af*^"^!'"^), X)) is regionally simple. Let us 
denote the restriction of this function on domain Q{Ri) by T : Q{Ri) — > R. Let 
us define the partial function T®^ ^, : ^ K as t ^ t + jF(succ(q, [t, R" , a))), 
for all t G M®, such that (s + t) G clos(_R']^). The following inequality follows from 
®: 

time(runfc+i(g,^,x)) > + ^(gi) > inf {^(® W : s + t G clos(i?'i)}. 

Since /i*^'^-*^) is a type-preserving boundary strategy of player Min, from equation 
([T]), we know that t[ = mf{t : s -f i G clos(i?']^)}. Moreover from Proposi- 
tion [10] we have that ^, is continuous and nondecreasing on the domain 

{i G Me : + i) e clos(i?'')}. Hence = mf* {^(®,fli,a)(0 s + t€ 

clos(i?'j^)}. Combining these facts, we get the following inequalities: 

time(runfc+i(g,M,x)) >^(%,^,)(i'i) = <'i+time(runfc(g;,M(''i'^),X)) 

Since runk{q'i, M^'^^''^\ X) is length k suffix of runk+i{q, ^i^'^'^\x), we get the 
desired inequality. □ 

D Proof of Proposition [7] 

Proof. The proof is by induction on n. The base case, when n = 0, is trivial. 
In the rest of the proof we show that for x S S'Max, A* G ^Min, e > 0, Xe & 
-^Max ' ^'^^ ^ configuration q = (s, R) G Q, we have that time(runfc+i(g, ^, Xe)) > 
time(runfc+i(g, A*^'^'^^-', x)) — k-e, assuming that the proposition holds for n — k. 



Let us fix X e -Max, fJ- e Suin, £ > 0, and Xe e ^Max ■ Let runk+l{q,^i,Xs) 
and runi.+i(gf,^(«'X^),x) be (go,Ti,gi, • ■ ■,qk+i) and (q(,,r{,gi, . . ■,q'k,+i), respec- 
tively, where qo = — q- Notice that by definition the run types of both runs are 
the same. Hence for every index i < k+1 we have qi — (si, Ri) and q[ — {s[, Ri), 
and for every index i < k + 1 we have = [ti, R[, ai) and t[ = (t^, R[^ a^). 

Let X G ^Max and M e i^Min be such that runfe((7i, M, X^) be length k suffix 
of runfe_(.i(g, /i, Xe). Notice that we assume that X is type-preserving. It is easy 
to see that 

time(runfe+i(g, ^,Xe)) = h time(runfe(gi, M, Xg)). 

From inductive hypothesis, we get that 

time(mnfc+i(g, /i, Xe)) > h + t\me{runk{qi, M^'^^^'K, X)) ~ k ■ e. (4) 

Since the strategies M^'i^^^^ G ^Min and X G S'Max are type-preserving bound- 
ary strategies, from Proposition |4] we get that time(runfc(-, M'^^^''^^^ X)) is re- 
gionally simple. Let us denote the restriction of this function on domain Q{Ri) 
by T : Q{Ri) K. Let us define the partial function JF® ^, : ^ M as 
t^t + T{succ{q, {t, R",a))), for aU t G M®: such that (s + t) e clos(i?;). The 
following inequality follows from Q: 

time(runfc+i((7,^,Xe)) > ti+T{qi)-k-e > inf {j?^^® : s+t G c\os{R[)} ~k-e. 

We need to consider two cases: q G Q-yiin and g G Qjviax- 

— Assume that q G QMin- Since fi^i-'^^^ is a type-preserving boundary strategy 
of player Min, from equation ([T]), we know that t[ = mf{t : s + t & c\os{R[)}. 
Moreover from Proposition (TU] we have that ^, is continuous and non- 
decreasing on the domain {t G Kg : (s + <) G clos(i?")}. Hence JF® ^, (i'j^) = 
inft {^(® : s + t G clos(i?'i)}. Comb ining these facts, we get the fol- 
lowing inequalities: 

time(mnfe+i(g,^,X£)) > t'l + time(runfc(q'i, Af^'i'^^) , X)) - fc • e. 

Since runk{q'i, M^'i^''^^\ X) is length fc suffix of runfc+i(g, ^'"^^^^^ x), we get 
the following inequality: 

time(runfc+i(g,/i,Xe)) > time(runfe+i(g, ^('^i''^^), x)) -k-e 

> t\me{runk+i{q,fi^'^''^^\x)) - (fc + 1) • e, 

as required. 

— Assume that q G QMax- So far we have shown that 

time(runfe+i {q, fi, Xe)) > ti + ^{qi) -k-e. (5) 



Since is a simple function let J-{{si,Ri)) — h — si(c) for all (si,i?i) G 
Q{Ri). For all t G such that s + t G i?']^ we have the following observation. 

, , \t + b ifcGffai) , , 

t + T{{succ{s,{t,a,))) = { ^) (6) 

I — s[c) otherwise. 

By Definition [12] we know that ti > t[ — e. Combining this with ([6]) we get 
that 

ti+T{qi) ■>t'^+T{q'^)-e. 
We can then rewrite ([5]) as the following: 

time(rurn;+i {q, fi,Xe))> A + -^(^i) - (fc + 1) • £■ 

The term J-{q'i) represents the sum of the times of runk{q[, M'-'^'^''^'\ X). 
Since runj:{q[, M^'^^''^^\ X) is length k suffix of run/j+i (g, x), we get 

the inequality 

time(runfc+i(q,/^,X£)) > time(runfe+i((7, x)) - (fc + 1) • e, 
as required. 

□ 



